Using Triangle Inequality to Efficiently Process Continuous Queries on High-Dimensional Streaming Time Series
نویسندگان
چکیده
In many applications, it is important to quickly find, from a database of patterns, the nearest neighbors of highdimensional query points that come into the system in a streaming form. Treating each query point as a separate one is inefficient. Consecutive query points are often neighbors in the high-dimensional space, and intermediate results in the processing of one query should help the processing of the next. This paper extends the KD tree with triangle inequality to deal with high-dimensional streaming time series. More specifically, the distances calculated for earlier query points (to patterns) are used to filter out patterns that are not possible to be the nearest neighbor of the current one. Experiments show that this extension works well.
منابع مشابه
Online Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features
Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...
متن کاملSimilarity Search on Bregman Divergence: Towards Non-Metric Indexing
In this paper, we examine the problem of indexing over non-metric distance functions. In particular, we focus on a general class of distance functions, namely Bregman Divergence [6], to support nearest neighbor and range queries. Distance functions such as KL-divergence and Itakura-Saito distance, are special cases of Bregman divergence, with wide applications in statistics, speech recognition ...
متن کاملارائه روشی پویا جهت پاسخ به پرسوجوهای پیوسته تجمّعی اقتضایی
Data Streams are infinite, fast, time-stamp data elements which are received explosively. Generally, these elements need to be processed in an online, real-time way. So, algorithms to process data streams and answer queries on these streams are mostly one-pass. The execution of such algorithms has some challenges such as memory limitation, scheduling, and accuracy of answers. They will be more ...
متن کاملProcessing Spatio-Temporal Queries in a Streaming Fashion
In the past few years, the wide application of online map applications and location based services have significantly changed the ways of our life. Today, it is typical for us to search for directions or businesses on the Internet and find our ways through cell phones. Lots of the location based services require to provide answers to users continuously so that the users can interact with the ap...
متن کاملProcessing Spatio-Temporal Queries through a Streaming Fashion
In the past few years, the wide application of online map applications and location based services have significantly changed the ways of our life. Today, it is typical for us to search for directions or businesses on the Internet and find our ways through cell phones. Lots of the location based services require to provide answers to users continuously so that the users can interact with the ap...
متن کامل